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BACKGROUND OF THE INVENTION 

1 5 Field of the Invention 

The present invention relates generally to the fields of 
cellular biology and the diagnosis of neoplastic disease. More 
specifically, the present invention relates to a novel extracellular 
serine protease termed Tumor Antigen Derived Gene-14 (TADG-14). 

20 

Description of the Related Art 

Extracellular proteases have been directly associated with 
tumor growth, shedding of tumor cells and invasion of target organs. 
Individual classes of proteases are involved in, but not limited to (1) 

2 5 the digestion of stroma surrounding the initial tumor area, (2) the 

1 



digestion of the cellular adhesion molecules to allow dissociation of 
tumor cells; and (3) the invasion of the basement membrane for 
metastatic growth and the activation of both tumor growth factors 
and angiogenic factors. 
5 The prior art is deficient in the lack of effective means of 

screening to identify proteases overexpressed in carcinoma. The 
present invention fulfills this longstanding need and desire in the art. 



1 o SUMMARY OF THE INVENTION 

The present invention discloses a screening system to 
identify proteases overexpressed in carcinoma by examining PCR 
products amplified from early-stage tumors, metastatic tumors, and 
1 5 normal ovarian epithelium. 

In one embodiment of the present invention, there is 
provided a DNA encoding a TADG-14 protein selected from the group 
consisting of: (a) isolated DNA which encodes a TADG-14 protein; (b) 
isolated DNA which hybridizes to isolated DNA of (a) above and 

2 0 which encodes a TADG-14 protein; and (c) isolated DNA differing 

from the isolated DNAs of (a) and (b) above in codon sequence due to 
the degeneracy of the genetic code, and which encodes a TADG-14 
protein. 

In another embodiment of the present invention, there is 
2 5 provided a vector capable of expressing the DNA of the present 
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invention adapted for expression in a recombinant cell and regulatory 
elements necessary for expression of the DNA in the cell. 

In yet another embodiment of the present invention, 
there is provided a host cell transfected with the vector of the 
5 present invention, said vector expressing a TADG-14 protein 

In still yet another embodiment of the present 
invention, there is provided a method of detecting expression of a 
TADG-14 mRNA, comprising the steps of: (a) contacting mRNA 
obtained from the cell with the labeled hybridization probe; and (b) 
10 detecting hybridization of the probe with the mRNA. 

Other and further aspects, features, and advantages of 
the present invention will be apparent from the following description 
of the presently preferred embodiments of the invention given for 
the purpose of disclosure. 

15 

BRIEF DESCRIPTION OF THE DRAWINGS 

So that the matter in which the above-recited features, 
advantages and objects of the invention, as well as others which will 
20 become clear, are attained and can be understood in detail, more 
particular descriptions of the invention briefly summarized above 
may be had by reference to certain embodiments thereof which are 
illustrated in the appended drawings. These drawings form a part 
of the specification. It is to be noted, however, that the appended 
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drawings illustrate preferred embodiments of the invention and 
therefore are not to be considered limiting in their scope. 

Figure 1 shows a comparison of PCR products derived 
from normal and carcinoma cDNA as shown by staining in an agarose 
5 gel. Two distinct bands (lane 2) were present in the primer pair 
sense-His-antisense Asp (AS1) and multiple bands of about 500 base 
pairs are noted in the carcinoma lane for the sense-His antisense-Ser 
(AS2) primer pairs (lane 4). 

Figure 2 shows a comparison of the amino acid sequence 
1 0 of TADG-14's catalytic domains. 

Figure 3 shows the overexpression of TADG-14 in 
ovarian carcinomas . 

Figure 4 shows the TADG-14 expression in tumors and 

cell lines. 

1 5 Figure 5 shows the blots of TADG-14 expression in fetal, 

adult and ovarian carcinoma tissues. 

Figure 6 shows the complete sequence of the TADG-14 
transcript including the open reading frame and common domains. 

Figure 7 shows the homology of TADG-14 with mouse 

2 0 neuropsin. There was approximately 76% identity for the open 

reading frame and low homology outside of the open reading frame. 

Figure 8 shows the amino acid homology of TADG-14 
with mouse neuropsin. 

25 
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DETAILED DESCRIPTION OF THE INVENTION 



As used herein, the term "cDNA" shall refer to the DNA 
copy of the mRNA transcript of a gene. 

5 As used herein, the term "derived amino acid sequence" 

shall mean the amino acid sequence determined by reading the 
triplet sequence of nucleotide bases in the cDNA. 

As used herein the term "screening a library" shall refer 
to the process of using a labeled probe to check whether, under the 
10 appropriate conditions, there is a sequence complementary to the 
probe present in a particular DNA library. In addition, "screening a 
library" could be performed by PCR. 

As used herein, the term "PCR" refers to the polymerase 
chain reaction that is the subject of U.S. Patent Nos. 4,683,195 and 
15 4,683,202 to Mullis, as well as other improvements now known in 
the art. 

The TADG-14 cDNA is 1343 base pairs long (SEQ IS No: 6) 
and encoding for a 260 amino acid protein (SEQ IS No: 7). The 
availability of the TADG-14 gene opens the way for a number studies 
2 0 that can lead to various applications. For example, the TADG-14 gene 
underlies a specific human genetic disease, the cDNA can be the basis 
for a diagnostic predictive test. 

In accordance with the present invention there may be 
employed conventional molecular biology, microbiology, and 
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recombinant DNA techniques within the skill of the art. Such 
techniques are explained fully in the literature. See, e.g., Maniatis, 
Fritsch & Sambrook, "Molecular Cloning: A Laboratory Manual 
(1982); "DNA Cloning: A Practical Approach," Volumes I and II (D.N. 
5 Glover ed. 1985); "Oligonucleotide Synthesis" (MJ. Gait ed. 1984); 
"Nucleic Acid Hybridization" [B.D. Hames & S.J. Higgins eds. (1985)]; 
"Transcription and Translation" [B.D. Hames & SJ. Higgins eds. 
(1984)]; "Animal Cell Culture" [R.I. Freshney, ed. (1986)]; 
"Immobilized Cells And Enzymes" [IRL Press, (1986)]; B. Perbal, "A 

10 Practical Guide To Molecular Cloning" (1984). 

Therefore, if appearing herein, the following terms shall 
have the definitions set out below. 

The amino acid described herein are preferred to be in 
the "L" isomeric form. However, residues in the "D" isomeric form 

1 5 can be substituted for any L-amino acid residue, as long as the 
desired functional property of immunoglobulin-binding is retained 
by the polypeptide. NH2 refers to the free amino group present at 
the amino terminus of a polypeptide. COOH refers to the free 
carboxy group present at the carboxy terminus of a polypeptide. In 

2 0 keeping with standard polypeptide nomeclature, / Biol. Chem., 
243:3552-59 (1969), abbreviations for amino acid residues are 
shown in the following Table of Correspondence: 
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TABLE OF CORRESPONDENCE 



SYMBOL 




AMINO ACID 


1-Letter 


3-Letter 




Y 


Tyr 


tyrosine 


G 


Gly 


glycine 


F 


Phe 


Phenylalanine 


M 


Met 


methionine 


A 


Ala 


alanine 


S 


Ser 


serine 


I 


He 


isoleucine 


L 


Leu 


leucine 


T 


Thr 


threonine 


V 


Val 


valine 


P 


Pro 


proline 


K 


Lys 


lysine 


H 


His 


histidine 


Q 


Gin 


glutamine 


E 


Glu 


glutamic acid 


W 


Trp 


tryptophan 


R 


Arg 


arginine 


D 


Asp 


aspartic acid 


N 


Asn 


asparagine 


C 


Cys 


cysteine 



25 

It should be noted that all amino-acid residue sequences 
are represented herein by formulae whose left and right orientation 
is in the conventional direction of amino-terminus to carboxy- 
terminus. Furthermore, it should be noted that a dash at the 
3 0 beginning or end of an amino acid residue sequence indicates a 
peptide bond to a further sequence of one or more amino-acid 
residues. The above Table is presented to correlate the three-letter 
and one-letter notations which may appear alternately herein. 
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A "replicon" is any genetic element (e.g., plasmid, 
chromosome, virus) that functions as an autonomous unit of DNA 
replication in vivo; i.e., capable of replication under its own control. 

A "vector" is a replicon, such as plasmid, phage or cosmid, 
5 to which another DNA segment may be attached so as to bring about 
the replication of the attached segment. 

A "DNA molecule" refers to the polymeric form of 
deoxyribonucleotides (adenine, guanine, thymine, or cytosine) in its 
either single stranded form, or a double-stranded helix. This term 

1 0 refers only to the primary and secondary structure of the molecule, 
and does not limit it to any particular tertiary forms. Thus, this term 
includes double- stranded DNA found, inter alia, in linear DNA 
molecules (e.g., restriction fragments), viruses, plasmids, and 
chromosomes. In discussing the structure herein according to the 

1 5 normal convention of giving only the sequence in the 5' to 3' 
direction along the nontranscribed strand of DNA (i.e., the strand 
having a sequence homologous to the mRNA). 

An "origin of replication" refers to those DNA sequences 
that participate in DNA synthesis. 

2 0 A DNA "coding sequence" is a double-stranded DNA 

sequence which is transcribed and translated into a polypeptide in 
vivo when placed under the control of appropriate regulatory 
sequences. The boundaries of the coding sequence are determined 
by a start codon at the 5' (amino) terminus and a translation stop 

2 5 codon at the 3' (carboxyl) terminus. A coding sequence can include, 
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but is not limited to, prokaryotic sequences, cDNA from eukaryotic 
mRNA, genomic DNA sequences from eukaryotic (e.g., mammalian) 
DNA, and even synthetic DNA sequences. A polyadenylation signal 
and transcription termination sequence will usually be located 3' to 
5 the coding sequence. 

Transcriptional and translational control sequences are 
DNA regulatory sequences, such as promoters, enhancers, 
polyadenylation signals, terminators, and the like, that provide for 
the expression of a coding sequence in a host cell. 

10 A "promoter sequence" is a DNA regulatory region 

capable of binding RNA polymerase in a cell and initiating 
transcription of a downstream (3' direction) coding sequence. For 
purposes of defining the present invention, the promoter sequence is 
bounded at its 3' terminus by the transcription initiation site and 

15 extends upstream (5' direction) to include the minimum number of 
bases or elements necessary to initiate transcription at levels 
detectable above background. Within the promoter sequence will be 
found a transcription initiation site, as well as protein binding 
domains (consensus sequences) responsible for the binding of RNA 

2 0 polymerase. Eukaryotic promoters often, but not always, contain 
"TATA" boxes and "CAT" boxes. Prokaryotic promoters contain 
Shine-Dalgarno sequences in addition to the -10 and -35 consensus 
sequences. 

An "expression control sequence" is a DNA sequence that 
2 5 controls and regulates the transcription and translation of another 
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DNA sequence. A coding sequence is "under the control" of 
transcriptional and translational control sequences in a cell when 
RNA polymerase transcribes the coding sequence into mRNA, which 
is then translated into the protein encoded by the coding sequence. 
5 A "signal sequence" can be included near the coding 

sequence. This sequence encodes a signal peptide, N-terminal to the 
polypeptide, that communicates to the host cell to direct the 
polypeptide to the cell surface or secrete the polypeptide into the 
media, and this signal peptide is clipped off by the host cell before 

10 the protein leaves the cell. Signal sequences can be found associated 
with a variety of proteins native to prokaryotes and eukaryotes. 

The term "oligonucleotide", as used herein in referring to 
the probe of the present invention, is defined as a molecule 
comprised of two or more ribonucleotides, preferably more than 

1 5 three. Its exact size will depend upon many factors which, in turn, 
depend upon the ultimate function and use of the oligonucleotide. 

The term "primer" as used herein refers to an 
oligonucleotide, whether occurring naturally as in a purified 
restriction digest or produced synthetically, which is capable of 

2 0 acting as a point of initiation of synthesis when placed under 
conditions in which synthesis of a primer extension product, which is 
complementary to a nucleic acid strand, is induced, i.e., in the 
presence of nucleotides and an inducing agent such as a DNA 
polymerase and at a suitable temperature and pH. The primer may 

2 5 be either single-stranded or double-stranded and must be 
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sufficiently long to prime the synthesis of the desired extension 
product in the presence of the inducing agent. The exact length of 
the primer will depend upon many factors, including temperature, 
source of primer and use the method. For example, for diagnostic 
5 applications, depending on the complexity of the target sequence, the 
oligonucleotide primer typically contains 15-25 or more nucleotides, 
although it may contain fewer nucleotides. 

The primers herein are selected to be "substantially" 
complementary to different strands of a particular target DNA 

10 sequence. This means that the primers must be sufficiently 
complementary to hybridize with their respective strands. 
Therefore, the primer sequence need not reflect the exact sequence 
of the template. For example, a non-complementary nucleotide 
fragment may be attached to the 5' end of the primer, with the 

1 5 remainder of the primer sequence being complementary to the 
strand. Alternatively, non-complementary bases or longer sequences 
can be interspersed into the primer, provided that the primer 
sequence has sufficient complementarity with the sequence or 
hybridize therewith and thereby form the template for the synthesis 

2 0 of the extension product. 

As used herein, the terms "restriction endonucleases" and 
"restriction enzymes" refer to enzymes, each of which cut double- 
stranded DNA at or near a specific nucleotide sequence. 

A cell has been "transformed" by exogenous or 

2 5 heterologous DNA when such DNA has been introduced inside the 



cell. The transforming DNA may or may not be integrated 
(covalently linked) into the genome of the cell. In prokaryotes, 
yeast, and mammalian cells for example, the transforming DNA may 
be maintained on an episomal element such as a plasmid. With 
5 respect to eukaryotic cells, a stably transformed cell is one in which 
the transforming DNA has become integrated into a chromosome so 
that it is inherited by daughter cells through chromosome replication. 
This stability is demonstrated by the ability of the eukaryotic cell to 
establish cell lines or clones comprised of a population of daughter 

10 cells containing the transforming DNA. A "clone" is a population of 
cells derived from a single cell or ancestor by mitosis. A "cell line" is 
a clone of a primary cell that is capable of stable growth in vitro for 
many generations. 

Two DNA sequences are "substantially homologous" when 

15 at least about 75% (preferably at least about 80%, and most 
preferably at least about 90% or 95%) of the nucleotides match over 
the defined length of the DNA sequences. Sequences that are 
substantially homologous can be identified by comparing the 
sequences using standard software available in sequence data banks, 

20 or in a Southern hybridization experiment under, for example, 
stringent conditions as defined for that particular system. Defining 
appropriate hybridization conditions is within the skill of the art. 
See, e.g., Maniatis et al., supra; DNA Cloning, Vols. I & II, supra; 
Nucleic Acid Hybridization, supra. 
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A "heterologous' region of the DNA construct is an 
identifiable segment of DNA within a larger DNA molecule that is not 
found in association with the larger molecule in nature. Thus, when 
the heterologous region encodes a mammalian gene, the gene will 
5 usually be flanked by DNA that does not flank the mammalian 
genomic DNA in the genome of the source organism. In another 
example, coding sequence is a construct where the coding sequence 
itself is not found in nature (e.g., a cDNA where the genomic coding 
sequence contains introns, or synthetic sequences having codons 

10 different than the native gene). Allelic variations or naturally- 
occurring mutational events do not give rise to a heterologous region 
of DNA as defined herein. 

The labels most commonly employed for these studies 
are radioactive elements, enzymes, chemicals which fluoresce when 

15 exposed to untraviolet light, and others. A number of fluorescent 
materials are known and can be utilized as labels. These include, for 
example, fluorescein, rhodamine, auramine, Texas Red, AMCA blue 
and Lucifer Yellow. A particular detecting material is anti-rabbit 
antibody prepared in goats and conjugated with fluorescein through 

2 0 an isothiocyanate. 

Proteins can also be labeled with a radioactive element or 
with an enzyme. The radioactive label can be detected by any of the 
currently available counting procedures. The preferred isotope may 
be selected from W, 14 C, 32 P , 35 S , 36 C 1, 5i Cr , 57 Co , 58 Co , 59 Fe , 90 Y , 1251, 

2 5 1311, and ™<>Re. 
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Enzyme labels are likewise useful, and can be detected by 
any of the presently utilized colorimetric, spectrophotometric, 
fluorospectrophotometric, amperometric or gasometric techniques. 
The enzyme is conjugated to the selected particle by reaction with 
5 bridging molecules such as carbodiimides, diisocyanates, 
glutaraldehyde and the like. Many enzymes which can be used in 
these procedures are known and can be utilized. The preferred are 
peroxidase, p-glucuronidase, p-D-glucosidase, (3-D-galactosidase, 
urease, glucose oxidase plus peroxidase and alkaline phosphatase. 
10 U.S. Patent Nos. 3,654,090, 3,850,752, and 4,016,043 are referred to 
by way of example for their disclosure of alternate labeling material 
and methods. 

A particular assay system developed and utilized in the 
art is known as a receptor assay. In a receptor assay, the material to 

15 be assayed is appropriately labeled and then certain cellular test 
colonies are inoculated with a quantitiy of both the label after which 
binding studies are conducted to determine the extent to which the 
labeled material binds to the cell receptors. In this way, differences 
in affinity between materials can be ascertained. 

2 0 An assay useful in the art is known as a "cis/trans" assay. 

Briefly, this assay employs two genetic constructs, one of which is 
typically a plasmid that continually expresses a particular receptor of 
interest when transfected into an appropriate cell line, and the 
second of which is a plasmid that expresses a reporter such as 

2 5 luciferase, under the control of a receptor/ligand complex. Thus, for 
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example, if it is desired to evaluate a compound as a ligand for a 
particular receptor, one of the plasmids would be a construct that 
results in expression of the receptor in the chosen cell line, while the 
second plasmid would possess a promoter linked to the luciferase 
5 gene in which the response element to the particular receptor is 
inserted. If the compound under test is an agonist for the receptor, 
the ligand will complex with the receptor, and the resulting complex 
will bind the response element and initiate transcription of the 
luciferase gene. The resulting chemiluminescence is then measured 
1 0 photometrically, and dose response curves are obtained and 
compared to those of known ligands. The foregoing protocol is 
described in detail in U.S. Patent No. 4,981,784. 

As used herein, the term "host" is meant to include not 
only prokaryotes but also eukaryotes such as yeast, plant and animal 

1 5 cells. A recombinant DNA molecule or gene which encodes a human 

TADG-14 protein of the present invention can be used to transform a 
host using any of the techniques commonly known to those of 
ordinary skill in the art. Especially preferred is the use of a vector 
containing coding sequences for the gene which encodes a human 

2 0 TADG-14 protein of the present invention for purposes of prokaryote 

transformation. 

Prokaryotic hosts may include E. coli, S. tymphimurium, 
Serratia marcescens and Bacillus subtilis. Eukaryotic hosts include 
yeasts such as Pichia pastoris, mammalian cells and insect cells. 
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In general, expression vectors containing promoter 
sequences which facilitate the efficient transcription of the inserted 
DNA fragment are used in connection with the host. The expression 
vector typically contains an origin of replication, promoter(s), 
5 terminator(s), as well as specific genes which are capable of 
providing phenotypic selection in transformed cells. The 
transformed hosts can be fermented and cultured according to means 
known in the art to achieve optimal cell growth. 

The invention includes a substantially pure DNA 

10 encoding a TADG-14 protein, a strand of which DNA will hybridize at 
high stringency to a probe containing a sequence of at least 15 
consecutive nucleotides of (SEQ ID NO:6). The protein encoded by 
the DNA of this invention may share at least 80% sequence identity 
(preferably 85%, more preferably 90%, and most preferably 95%) 

15 with the amino acids listed in Figure 6 (SEQ ID NO: 7). More 
preferably, the DNA includes the coding sequence of the nucleotides 
of Figure 6 (SEQ ID NO:6), or a degenerate variant of such a 
sequence. 

The probe to which the DNA of the invention hybridizes 
2 0 preferably consists of a sequence of at least 20 consecutive 
nucleotides, more preferably 40 nucleotides, even more preferably 
50 nucleotides, and most preferably 100 nucleotides or more (up to 
100%) of the coding sequence of the nucleotides listed in Figure 6 
(SEQ ID NO: 6) or the complement thereof. Such a probe is useful for 
2 5 detecting expression of TADG-14 in a human cell by a method 
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including the steps of (a) contacting mRNA obtained from the cell 
with the labeled hybridization probe; and (b) detecting 
hybridization of the probe with the mRNA. 

This invention also includes a substantially pure DNA 
5 containing a sequence of at least 15 consecutive nucleotides 
(preferably 20, more preferably 30, even more preferably 50, and 
most preferably all) of the region from nucleotides 1 to 1343 of the 
nucleotides listed in Figure 6 (SEQ ID NO: 6). 

By "high stringency" is meant DNA hybridization and 

1 0 wash conditions characterized by high temperature and low salt 
concentration, e.g., wash conditions of 65°C at a salt concentration of 
approximately 0.1 x SSC, or the functional equivalent thereof. For 
example, high stringency conditions may include hybridization at 
about 42°C in the presence of about 50% formamide; a first wash at 

15 about 65°C with about 2 x SSC containing 1% SDS; followed by a 
second wash at about 65°C with about 0.1 x SSC. 

By "substantially pure DNA" is meant DNA that is not 
part of a milieu in which the DNA naturally occurs, by virtue of 
separation (partial or total purification) of some or all of the 

2 0 molecules of that milieu, or by virtue of alteration of sequences that 
flank the claimed DNA. The term therefore includes, for example, a 
recombinant DNA which is incorporated into a vector, into an 
autonomously replicating plasmid or virus, or into the genomic DNA 
of a prokaryote or eukaryote; or which exists as a separate molecule 

2 5 (e.g., a cDNA or a genomic or cDNA fragment produced by 
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polymerase chain reaction (PCR) or restriction endonuclease 
digestion) independent of other sequences. It also includes a 
recombinant DNA which is part of a hybrid gene encoding additional 
polypeptide sequence, e.g., a fusion protein. Also included is a 
5 recombinant DNA which includes a portion of the nucleotides listed 
in Figure 6 (SEQ ID NO: 6) which encodes an alternative splice 
variant of TADG-14. 

The DNA may have at least about 70% sequence identity 
to the coding sequence of the nucleotides listed in Figure 6 
10 (SEQIDNO:6), preferably at least 75% (e.g. at least 80%); and most 
preferably at least 90%. The identity between two sequences is a 
direct function of the number of matching or identical positions. 
When a subunit position in both of the two sequences is occupied by 
the same monomeric subunit, e.g., if a given position is occupied by 

1 5 an adenine in each of two DNA molecules, then they are identical at 

that position. For example, if 7 positions in a sequence 
10 nucleotides in length are identical to the corresponding positions 
in a second 10-nucleotide sequence, then the two sequences have 
70% sequence identity. The length of comparison sequences will 

2 0 generally be at least 50 nucleotides, preferably at least 60 

nucleotides, more preferably at least 75 nucleotides, and most 
preferably 100 nucleotides. Sequence identity is typically 
measured using sequence analysis software (e.g., Sequence Analysis 
Software Package of the Genetics Computer Group, University of 
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Wisconsin Biotechnology Center, 1710 University Avenue, Madison, 
WI 53705). 

The present invention comprises a vector comprising a 
DNA sequence coding for a which encodes a human TADG-14 protein 
5 and said vector is capable of replication in a host which comprises, 
in operable linkage: a) an origin of replication; b) a promoter; and c) 
a DNA sequence coding for said protein. Preferably, the vector of 
the present invention contains a portion of the DNA sequence shown 
in SEQ ID No: 6. A "vector" may be defined as a replicable nucleic 
1 0 acid construct, e.g., a plasmid or viral nucleic acid. Vectors may be 
used to amplify and/or express nucleic acid encoding TADG-14 
protein. An expression vector is a replicable construct in which a 
nucleic acid sequence encoding a polypeptide is operably linked to 
suitable control sequences capable of effecting expression of the 

1 5 polypeptide in a cell. The need for such control sequences will vary 

depending upon the cell selected and the transformation method 
chosen. Generally, control sequences include a transcriptional 
promoter and/or enhancer, suitable mRNA ribosomal binding sites, 
and sequences which control the termination of transcription and 

2 0 translation. Methods which are well known to those skilled in the 

art can be used to construct expression vectors containing 
appropriate transcriptional and translational control signals. See for 
example, the techniques described in Sambrook et al., 1989, 
Molecular Cloning: A Laboratory Manual (2nd Ed.), Cold Spring 
2 5 Harbor Press, N.Y. A gene and its transcription control sequences 
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are defined as being "operably linked" if the transcription control 
sequences effectively control the transcription of the gene. Vectors 
of the invention include, but are not limited to, plasmid vectors and 
viral vectors. Preferred viral vectors of the invention are those 
5 derived from retroviruses, adenovirus, adeno-associated virus, SV40 
virus, or herpes viruses. 

By a "substantially pure protein" is meant a protein 
which has been separated from at least some of those components 
which naturally accompany it. Typically, the protein is substantially 

10 pure when it is at least 60%, by weight, free from the proteins and 
other naturally-occurring organic molecules with which it is 
naturally associated in vivo. Preferably, the purity of the 
preparation is at least 75%, more preferably at least 90%, and most 
preferably at least 99%, by weight. A substantially pure TADG-14 

1 5 protein may be obtained, for example, by extraction from a natural 
source; by expression of a recombinant nucleic acid encoding an 
TADG-14 polypeptide; or by chemically synthesizing the protein. 
Purity can be measured by any appropriate method, e.g., column 
chromatography such as immunoaffinity chromatography using an 

2 0 antibody specific for TADG-14, polyacrylamide gel electrophoresis, 
or HPLC analysis. A protein is substantially free of naturally 
associated components when it is separated from at least some of 
those contaminants which accompany it in its natural state. Thus, a 
protein which is chemically synthesized or produced in a cellular 

25 system different from the cell from which it naturally originates will 
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be, by definition, substantially free from its naturally associated 
components. Accordingly, substantially pure proteins include 
eukaryotic proteins synthesized in E. coli, other prokaryotes, or any 
other organism in which they do not naturally occur. 
5 In addition to substantially full-length proteins, the 

invention also includes fragments (e.g., antigenic fragments) of the 
TADG-14 protein (SEQ ID No: 7). As used herein, "fragment," as 
applied to a polypeptide, will ordinarily be at least 10 residues, 
more typically at least 20 residues, and preferably at least 30 (e.g., 

1 0 50) residues in length, but less than the entire, intact sequence. 
Fragments of the TADG-14 protein can be generated by methods 
known to those skilled in the art, e.g., by enzymatic digestion of 
naturally occurring or recombinant TADG-14 protein, by 
recombinant DNA techniques using an expression vector that 

15 encodes a defined fragment of TADG-14, or by chemical synthesis. 
The ability of a candidate fragment to exhibit a characteristic of 
TADG-14 (e.g., binding to an antibody specific for TADG-14) can be 
assessed by methods described herein. Purified TADG-14 or 
antigenic fragments of TADG-14 can be used to generate new 

2 0 antibodies or to test existing antibodies (e.g., as positive controls in a 
diagnostic assay) by employing standard protocols known to those 
skilled in the art. Included in this invention are polyclonal antisera 
generated by using TADG-14 or a fragment of TADG-14 as the 
immunogen in, e.g., rabbits. Standard protocols for monoclonal and 

2 5 polyclonal antibody production known to those skilled in this art are 
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employed. The monoclonal antibodies generated by this procedure 
can be screened for the ability to identify recombinant TADG-14 
cDNA clones, and to distinguish them from known cDNA clones. 

Further included in this invention are TADG-14 proteins 
5 which are encoded at least in part by portions of SEQ ID NO: 
7, e.g., products of alternative mRNA splicing or alternative protein 
processing events, or in which a section of TADG-14 sequence has 
been deleted. The fragment, or the intact TADG-14 polypeptide, 
may be covalently linked to another polypeptide, e.g. which acts as a 

1 0 label, a ligand or a means to increase antigenicity. 

The invention also includes a polyclonal or monoclonal 
antibody which specifically binds to TADG-14. The invention 
encompasses not only an intact monoclonal antibody, but also an 
immunologically-active antibody fragment, e.g., a Fab or (Fab)2 

1 5 fragment; an engineered single chain Fv molecule; or a chimeric 
molecule, e.g., an antibody which contains the binding specificity of 
one antibody, e.g., of murine origin, and the remaining portions of 
another antibody, e.g., of human origin. 

In one embodiment, the antibody, or a fragment thereof, 

2 0 may be linked to a toxin or to a detectable label, e.g. a radioactive 

label, non-radioactive isotopic label, fluorescent label, 
chemiluminescent label, paramagnetic label, enzyme label, or 
colorimetric label. Examples of suitable toxins include diphtheria 
toxin, Pseudomonas exotoxin A, ricin, and cholera toxin. Examples 
2 5 of suitable enzyme labels include malate hydrogenase, 
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staphylococcal nuclease, delta-5-steroid isomerase, alcohol 
dehydrogenase, alpha-glycerol phosphate dehydrogenase, triose 
phosphate isomerase, peroxidase, alkaline phosphatase, 
asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, 
5 urease, catalase, glucose-6-phosphate dehydrogenase, glucoamylase, 
acetylcholinesterase, etc. Examples of suitable radioisotopic labels 
include 3 H , ^l, ^I, 32 P , 35 s> 14 Cj etc . 

Paramagnetic isotopes for purposes of in vivo diagnosis 
can also be used according to the methods of this invention. There 

1 0 are numerous examples of elements that are useful in magnetic 
resonance imaging. For discussions on in vivo nuclear magnetic 
resonance imaging, see, for example, Schaefer et al., (1989) JACC 14, 
472-480; Shreve et al., (1986) Magn. Reson. Med. 3, 336-340; Wolf, 
G. L., (1984) Physiol Chem. Phys. Med. NMR 16, 93-95; Wesbey 

15 etal., (1984) Physiol. Chem. Phys. Med. NMR 16, 145-155; Runge et 
al., (1984) Invest. Radiol. 19, 408-415. Examples of suitable 
fluorescent labels include a fluorescein label, an isothiocyalate label, 
a rhodamine label, a phycoerythrin label, a phycocyanin label, an 
allophycocyanin label, an ophthaldehyde label, a fluorescamine 

2 0 label, etc. Examples of chemiluminescent labels include a luminal 
label, an isoluminal label, an aromatic acridinium ester label, an 
imidazole label, an acridinium salt label, an oxalate ester label, a 
luciferin label, a luciferase label, an aequorin label, etc. 

Those of ordinary skill in the art will know of other 

2 5 suitable labels which may be employed in accordance with the 
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present invention. The binding of these labels to antibodies or 
fragments thereof can be accomplished using standard techniques 
commonly known to those of ordinary skill in the art. Typical 
techniques are described by Kennedy et al., (1976) Clin. Chim. Acta 
5 70, 1-31; and Schurs et al., (1977) Clin. Chim. Acta 81, 1-40. 
Coupling techniques mentioned in the latter are the glutaraldehyde 
method, the periodate method, the dimaleimide method, the m- 
maleimidobenzyl-N-hydroxy-succinimide ester method. All of these 
methods are incorporated by reference herein. 

1 0 Also within the invention is a method of detecting 

TADG-14 protein in a biological sample, which includes the steps of 
contacting the sample with the labelled antibody, e.g., radioactively 
tagged antibody specific for TADG-14, and determining whether the 
antibody binds to a component of the sample. 

15 As described herein, the invention provides a number of 

diagnostic advantages and uses. For example, the TADG-14 protein 
is useful in diagnosing cancer in different tissues since this protein 
is absent in highly proliferating cells. Antibodies (or antigen- 
binding fragments thereof) which bind to an epitope specific for 

2 0 TADG-14, are useful in a method of detecting TADG-14 protein in a 
biological sample for diagnosis of cancerous or neoplastic 
transformation. This method includes the steps of obtaining a 
biological sample (e.g., cells, blood, plasma, tissue, etc.) from a 
patient suspected of having cancer, contacting the sample with a 

2 5 labelled antibody (e.g., radioactively tagged antibody) specific for 
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TADG-14, and detecting the TADG-14 protein using standard 
immunoassay techniques such as an ELISA. Antibody binding to the 
biological sample indicates that the sample contains a component 
which specifically binds to an epitope within TADG-14. 
5 Likewise, a standard Northern blot assay can be used to 

ascertain the relative amounts of TADG-14 mRNA in a cell or tissue 
obtained from a patient suspected of having cancer, in accordance 
with conventional Northern hybridization techniques known to 
those persons of ordinary skill in the art. This Northern assay uses 
10 a hybridization probe, e.g. radiolabeled TADG-14 cDNA, either 
containing the full-length, single stranded DNA having a sequence 
complementary to SEQ ID NO: 6 (Figure 6), or a fragment of that DNA 
sequence at least 20 (preferably at least 30, more preferably at 
least 50, and most preferably at least 100 consecutive nucleotides in 

1 5 length). The DNA hybridization probe can be labelled by any of the 

many different methods known to those skilled in this art. 

Antibodies to the TADG-14 protein can be used in an 
immunoassay to detect increased levels of TADG-14 protein 
expression in tissues suspected of neoplastic transformation. These 

2 0 same uses can be achieved with Northern blot assays and analyses. 

The present invention is directed to DNA encoding a 
TADG-14 protein selected from the group consisting of: (a) isolated 
DNA which encodes a TADG-14 protein; (b) isolated DNA which 
hybridizes to isolated DNA of (a) above and which encodes a TADG- 
2 5 14 protein; and (c) isolated DNA differing from the isolated DNAs of 
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(a) and (b) above in codon sequence due to the degeneracy of the 
genetic code, and which encodes a TADG-14 protein. Preferably, the 
DNA has the sequence shown in SEQ ID No. 6. More preferably, the 
DNA encodes a TADG-14 protein having the amino acid sequence 
5 shown in SEQ ID No. 7. 

The present invention is also directed to a vector capable 
of expressing the DNA of the present invention adapted for 
expression in a recombinant cell and regulatory elements necessary 
for expression of the DNA in the cell. Preferably, the vector contains 
1 0 DNA encoding a TADG-14 protein having the amino acid sequence 
shown in SEQ ID No. 7. 

The present invention is also directed to a host cell 
transfected with the vector described herein, said vector expressing 
a TADG-14 protein. Representative host cells include consisting of 

1 5 bacterial cells, mammalian cells and insect cells. 

The present invention is also directed to a isolated and 
purified TADG-14 protein coded for by DNA selected from the group 
consisting of: (a) isolated DNA which encodes a TADG-14 protein; (b) 
isolated DNA which hybridizes to isolated DNA of (a) above and 

2 0 which encodes a TADG-14 protein; and (c) isolated DNA differing 

from the isolated DNAs of (a) and (b) above in codon sequence due to 
the degeneracy of the genetic code, and which encodes a TADG-14 
protein. Preferably, the isolated and purified TADG-14 protein of 
claim 9 having the amino acid sequence shown in SEQ ID No. 7. 
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The present invention is also directed to a method of 
detecting expression of the protein of claim 1, comprising the steps 
of: (a) contacting mRNA obtained from the cell with the labeled 
hybridization probe; and (b) detecting hybridization of the probe 

5 with the mRNA. 

The following examples are given for the purpose of 
illustrating various embodiments of the invention and are not meant 
to limit the present invention in any fashion. 

10 EXAMPLE 1 

Tissue collection and storage 

Upon patient hysterectomy, bilateral salpingo- 
oophorectomy, or surgical removal of neoplastic tissue, the specimen 

15 is retrieved and placed it on ice. The specimen was then taken to the 
resident pathologist for isolation and identification of specific tissue 
samples. Finally, the sample was frozen in liquid nitrogen, logged 
into the laboratory record and stored at -80°C. Additional specimens 
were frequently obtained from the Cooperative Human Tissue 

2 0 Network (CHTN). These samples were prepared by the CHTN and 
shipped to us on dry ice. Upon arrival, these specimens were logged 
into the laboratory record and stored at -80°C. 

25 
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EXAMPLE 2 



mRNA isolation and cDNA synthesis 

Messenger RNA (mRNA) isolation was performed 
5 according to the manufacturer's instructions using the Mini 
RiboSepTM Ultra mRNA isolation kit purchased from Becton 
Dickinson (cat. # 30034). This was an oligo(dt) chromatography 
based system of mRNA isolation. The amount of mRNA recovered 
was quantitated by UV spectrophotometry. 
10 First strand complementary DNA (cDNA) was synthesized 

using 5.0 mg of mRNA and either random hexamer or oligo(dT) 
primers according to the manufacturer's protocol utilizing a first 
strand synthesis kit obtained from Clontech (cat.# K1402-1). The 
purity of the cDNA was evaluated by PCR using primers specific for 

1 5 the p53 gene. These primers span an intron such that pure cDNA can 

be distinguished from cDNA that is contaminated with genomic DNA. 

EXAMPLE 3 

2 0 PCR reactions 

Reactions were carried out as follows: first strand cDNA 
generated from 50 ng of mRNA will be used as template in the 
presence of 1.0 mM MgC12, 0.2 mM dNTPs, 0.025 U Taq 
polymerase/ml of reaction, and lx buffer supplied with enzyme. In 
2 5 addition, primers must be added to the PCR reaction. Degenerate 
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primers which may amplify a variety of cDNAs are used at a final 
concentration of 2.0 mM each, whereas primers which amplify 
specific cDNAs are added to a final concentration of 0.2 mM each. 

After initial denaturation at 95°C for 3 minutes, thirty 
5 cycles of PCR are carried out in a Perkin Elmer Gene Amp 2400 
thermal cycler. Each cycle consists of 30 seconds of denaturation at 
95°C, 30 seconds of primer annealing at the appropriate annealing 
temperature*, and 30 seconds of extension at 72°C. The final cycle 
will be extended at 72°C for 7 minutes. To ensure that the reaction 

10 succeeded, a fraction of the mixture will be electrophoresed through 
a 2% agarose/TAE gel stained with ethidium bromide(final 
concentration 1 mg/ml). The annealing temperature varies according 
to the primers that are used in the PCR reaction. For the reactions 
involving degenerate primers, an annealing temperature of 48°C 

1 5 were used. The appropriate annealing temperature for the TADG14 
and P-tubulin specific primers is 62°C. 

EXAMPLE 4 

2 0 T-vector ligation and transformations 

The purified PCR products are ligated into the Promega T- 
vector plasmid and the ligation products are used to transform 
JM109 competent cells according to the manufacturer's instructions 
(Promega cat. #A3610). Positive colonies were cultured for 

2 5 amplification, the plasmid DNA isolated by means of the WizardTM 
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Minipreps DNA purification system (Promega cat #A7500), and the 
plasmids were digested with Apal and Sad restriction enzymes to 
determine the size of the insert. Plasmids with inserts of the size(s) 
visualized by the previously described PCR product gel 
5 electrophoresis were sequenced. 

EXAMPLE 5 

DNA sequencing 

1 0 Utilizing a plasmid specific primer near the cloning site, 

sequencing reactions were carried out using PRISMTM Ready 
Reaction Dye DeoxyTM terminators (Applied Biosystems cat# 
401384) according to the manufacturer's instructions. Residual dye 
terminators were removed from the completed sequencing reaction 

15 using a Centri-sepTM spin column (Princeton Separation cat.# CS- 
901). An Applied Biosystems Model 373 A DNA Sequencing System 
was available and was used for sequence analysis. Based upon the 
determined sequence, primers that specifically amplify the gene of 
interest were designed and synthesized. 

20 

EXAMPLE 6 

Northern blot analysis 

mRNAs (approximately 5 mg) were size separated by 
2 5 electrophoresis through a 6.3% formaldehyde, 1.2% agarose gel in 
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0.02 M MOPS, 0.05 M sodium acetate (pH 7.0), and 0.001 M EDTA. 
The mRNAs were then blotted to Hybond-N (Amersham) by capillary 
action in 20x SSPE. The RNAs are fixed to the membrane by baking 
for 2 hours at 80°C. Additional multiple tissue northern (MTN) blots 
5 were purchased from CLONTECH Laboratories, Inc. These blots 
include the Human MTN blot (cat.#7760-l), the Human MTN II blot 
(cat.#7759-l), the Human Fetal MTN II blot (cat.#7756-l), and the 
Human Brain MTN III blot (cat.#7750-l). The appropriate probes 
were radiolabeled utilizing the Prime-a-Gene Labelling System 
10 available from Promega (cat#U1100). The blots were probed and 
stripped according to the ExpressHyb Hybridization Solution protocol 
available from CLONTECH (cat.#8015-l or 8015-2). 

EXAMPLE 7 

1 5 Quantitative PCR 

Quantitative-PCR was performed in a reaction mixture 
consisting of cDNA derived from 50 ng of mRNA, 5 pmol of sense and 
antisense primers for TADG14 and the internal control p-tubulin, 0.2 
mmol of dNTPs, 0.5 mCi of [cc-32P]dCTP, and 0.625 U of Taq 

2 0 polymerase in lx buffer in a final volume of 25 ml. This mixture 

was subjected to 1 minute of denaturation at 95°C followed by 30 
cycles of denaturation for 30 seconds at 95 °C, 30 seconds of 
annealing at 62°C, and 1 minute of extension at 72°C with an 
additional 7 minutes of extension on the last cycle. The product was 
2 5 electrophoresed through a 2% agarose gel for separation, the gel was 



dried under vacuum and autoradiographed. The relative 
radioactivity of each band was determined by Phospholmager from 
Molecular Dynamics. 

5 

EXAMPLE 8 

The present invention describes the use of primers 
directed to conserved areas of the serine protease class to identify 

10 members of that class which are overexpressed in carcinoma. 
Several genes were identified and cloned in other tissues, but not 
previously associated with ovarian carcinoma. The present invention 
describes a novel protease identified in ovarian carcinoma. This gene 
was identified using primers to the conserved area surrounding the 

15 catalytic domain amino acid histidine and the catalytic domain amino 
acid serine which is about 150 amino acids downstream towards the 
carboxyl end. 

The gene encoding the novel extracellular serine protease 
of the present invention was identified from a group of proteases 
2 0 overexpressed in carcinoma by subcloning and sequencing the 
appropriate PCR products. An example of such a PCR reaction is 
given in Figure 1. Subcloning and sequencing of individual bands 
from such an amplification provided a basis for identifying the novel 
protease of the present invention. 

25 
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EXAMPLE 9 



The sequence determined for the catalytic domain of 
TADG-14 is presented in Figure 2 and is consistent with other serine 
5 proteases and specifically contains conserved amino acids 
appropriate for the catalytic domain of the serine protease family. 
Specific primers (20mers) derived from this sequence were used. 

A series of normal and tumors cDNAs were examined to 
determine the expression of the TADG-14 protein. In a series of 

1 0 three normals compared to nine carcinomas using p -tubulin as an 
internal control for PCR amplification, TADG-14 was significantly 
overexpressed in eight of the nine carcinomas and either was not 
detected or was detected at a very low level in normal epithelial 
tissue (Figure 3). This evaluation was extended to a standard panel 

15 of about 35 tumors. Using these specific primers, the expression of 
this gene was also examined in both tumor cell lines and other tumor 
tissues as shown in Figure 4. The expression of TADG-14 was also 
observed in breast carcinoma and colon carcinoma. TADG-14 
expression was not noted in other tissues. For example, TADG-14 

2 0 was not present in detectable levels by Northern blot analysis in any 
of the following normal tissues: fetal lung, fetal heart, fetal brain, 
fetal kidney, adult spleen, thymus, prostate, testis, ovary, small 
intestine, colon, peripheral blood leukocytes, heart, placenta, lung, 
liver, skeletal muscle, kidney, pancreas, amygdala, caudate nucleus, 
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corpus callosum, hippocampus, whole brain, substantia nigr, 
subthalamic nucleus and thalamus. 

Using the specific sequence for TADG-14 covering the full 
domain of the catalytic site as a probe for Northern blot analysis, 
5 three Northern blots were examined: one derived from ovarian 
tissues, both normal and carcinoma; one from fetal tissues; and one 
from adult normal tissues. As noted in Figure 5, abundant 
transcripts for TADG-14 were noted in ovarian carcinomas. 
Transcripts were noted in all carcinomas, but at lower levels in some 

10 sub-types of ovarian cancer. Furthermore, no transcript was 
observed from normal ovarian tissue. The transcript size was found 
to be approximately 1.4 kb. Of particular note is the fact that in the 
fetal tissue examined including brain, lung, liver, kidney and in 
multiple adult tissues examined, none of these blots showed 

15 expression for the TADG-14 transcript. The hybridization for the 
fetal and adult blots was appropriate and done with the same probe 
as with the ovarian tissue. Subsequent to this examination, it was 
confirmed that these blots contained other detectible mRNA 
transcripts 

2 0 Using the base sequence derived from the original full 

length PCR clone corresponding to nucleotides 713-1160 of the 
catalytic domain as a probe to screen libraries, an ovarian carcinoma 
library derived from ascites tumor cells was examined for the 
presence of TADG-14. Four clones were obtained, two of which 

2 5 covered the complete mRNA 1.4kb transcript of the TADG-14 gene. 
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The complete nucleotide sequence (SEQ ID No:6) is provided in Figure 
6 along with translation of the open reading frame (SEQ ID No:7). 

In the nucleotide sequence, there is a Kozak sequence 
typical of sequences upstream from the initiation site of translation. 
5 There is also a polyadenylation signal sequence and a poly-A tail. 
The open reading frame consists of a 260 amino acid sequence (SEQ 
ED No:7) which includes a secretion signal sequence in the first 25 
amino acids confirming the extracellular processing of the protease. 
Also a clear delineation of the catalytic domain conserved histidine, 

1 0 aspartic acid, serine series along with a series of amino acids 
conserved in the serine protease family is indicated. 

Examination of the databases for both the expressed tag 
sequence and complete transcripts provided seven genes that had 
significant homology to this newly identified serine protease. One 

1 5 gene was identified from mouse brain and a comparison of the 
nucleotide homology is provided in Figure 7. A comparison of the 
homology of the amino acid sequence is provided in Figure 8. 
Alignment of TADG-14 with mouse neuropsin revealed 77.2% 
similarity and 72.2% identity at the amino acid levels for these two 

2 0 genes. Given that the size of the mouse transcript is 1.4kb and that 
the mouse gene contains 260 amino acids and there is greater than 
70% homology, this gene may be a human equivalent of the mouse 
neuropsin gene or a family member of neuropsin-like genes. 

TADG-14 is secreted and expressed early in tumor 

2 5 development and has invasive capacity. TADG-14 therefore is a 
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potential diagnostic for ovarian and other cancers. TADG-14 also 
may be a target for intervention in regulating tumor spread by 
inhibition, gene therapy, antibody inactivation technology. In 
addition to its obvious usefulness in ovarian carcinoma and other 
5 carcinomas including the preliminary data on breast and prostate, 
the neuropsin-like qualities may provide an opportunity for 
usefulness in neuropathologic disorders. 

Any patents or publications mentioned in this 
specification are indicative of the levels of those skilled in the art to 
10 which the invention pertains. These patents and publications are 
herein incorporated by reference to the same extent as if each 
individual publication was specifically and individually indicated to 
be incorporated by reference. 

One skilled in the art will readily appreciate that the 
1 5 present invention is well adapted to carry out the objects and obtain 
the ends and advantages mentioned, as well as those inherent 
therein. The present examples along with the methods, procedures, 
treatments, molecules, and specific compounds described herein are 
presently representative of preferred embodiments, are exemplary, 
2 0 and are not intended as limitations on the scope of the invention. 
Changes therein and other uses will occur to those skilled in the art 
which are encompassed within the spirit of the invention as defined 
by the scope of the claims. 
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WHAT IS CLAIMED IS: 



1. DNA encoding a TADG-14 protein selected from the 
group consisting of: 

5 (a) isolated DNA which encodes a TADG-14 protein; 

(b) isolated DNA which hybridizes to isolated DNA of 
(a) above and which encodes a TADG-14 protein; and 

(c) isolated DNA differing from the isolated DNAs of (a) 
and (b) above in codon sequence due to the degeneracy of the 

1 0 genetic code, and which encodes a TADG-14 protein. 

2. The DNA of claim 1, wherein said DNA has the 
sequence shown in SEQ ID No. 6. 

15 

3. The DNA of claim 1, wherein said TADG-14 protein 
has the amino acid sequence shown in SEQ ID No. 7. 

20 

4. A vector capable of expressing the DNA of 
claim 1 adapted for expression in a recombinant cell and 
regulatory elements necessary for expression of the DNA in the 
cell. 

25 
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5. The vector of claim 4, wherein said DNA encodes a 
TADG-14 protein having the amino acid sequence shown in SEQ ID 
No. 7. 

5 

6. A host cell transfected with the vector of claim 4, 
said vector expressing a TADG-14 protein. 

10 7. The host cell of claim 6, wherein said cell is selected 

from group consisting of bacterial cells, mammalian cells, plant cells 
and insect cells. 

15 8. The host cell of claim 7, wherein said bacterial cell 

is E. coli. 

9. Isolated and purified TADG-14 protein coded for by 
2 0 DNA selected from the group consisting of: 

(a) isolated DNA which encodes a TADG-14 protein; 

(b) isolated DNA which hybridizes to isolated DNA of 
(a) above and which encodes a TADG-14 protein; and 
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(c) isolated DNA differing from the isolated DNAs of (a) 
and (b) above in codon sequence due to the degeneracy of the 
genetic code, and which encodes a TADG-14 protein. 



10. The isolated and purified TADG-14 protein of claim 
9 having the amino acid sequence shown in SEQ ID No. 7. 

11. A method of detecting expression of the protein of 
claim 1, comprising the steps of: 

(a) contacting mRNA obtained from the cell with the 
labeled hybridization probe; and 

(b) detecting hybridization of the probe with the mRNA. 



ABSTRACT OF THE DISCLOSURE 



The present invention provides a DNA encoding a TADG-14 
protein selected from the group consisting of: (a) isolated DNA which 
encodes a TADG-14 protein; (b) isolated DNA which hybridizes to 
isolated DNA of (a) above and which encodes a TADG-14 protein; and 
(c) isolated DNA differing from the isolated DNAs of (a) and (b) above 
in codon sequence due to the degeneracy of the genetic code, and 
which encodes a TADG-14 protein. Also provided is a vector capable 
of expressing the DNA of the present invention adapted for expression 
in a recombinant cell and regulatory elements necessary for 
expression of the DNA in the cell. 
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SERINE PROTEASE PRIMERS 



AS1 




AS2 



12 3 4 

1) Normal Ovary 2) Tumor 
3) Normal Ovary 4) Tumor 



Figure 1 shows a comparison of PCR 
products derived from normal and 
carcinoma cDNA as shown by staining in 
an agarose gel. Two distinct bands (lane 2) 
were present in the primer pair sense-His- 
antisense ASP-(ASl) and multiple bands of 
about 500 bp are noted in the carcinoma 
lane for the sense-His antisense-SER (AS2) 
primer pairs (lane 4). 
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F LGKHNLRQRE SSQEQSSVVR AVIHPDY . . - 

R LGDHSLQNKD GPEQEIPWQ SIPHPCY 

R LGEKNIEVLE GNEQFINAAK IIRHPQY. . . 
K LGSDTLGDRR A. .QRIKASK SFRHPGY... 
FP ERNRVLSRWR VFAGAVAQAS PHGLQLGVQA WYHGGYLPF 



WVLTAAHC <K PNLQV. 
WWTAAHCKK PKYTV. 
WWSAGHCYK SRIQV. 
WVLTAAHCfCM NEYTV. 
IWVLTAAHC 



251 

. - . DAASHDQ 
NSSDVEDKNH 
. . . DRKTLNN 
ST. . . QTHVN 
RDPNSESNSN 



DIMLLRLARP AKLSELIQPL PLERDCSA. . 
DLMLL 2LRDQ ASLGSKVKPI SLADHCTQ . . 
DIHLIKLSSR AVINARVSTI SLPTAPPA. . 
DLMLVKLNSQ ARLSSMVKKV RLPSRCEP.. 
DIALV3LSSP LPLTEYIQPV CLPAAGQALV 
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ADTCCGDSGG 
KDSCQ3DSGG 
KNACK GDSGG 
IDACQGDSSG^ 



Figure 2. 



Comparison of amino acid sequence of 
TADG-14 with known serine protease* catalytic 
domains . 



O J*. 

■a -a 




Normal 
Normal 

Normal 

Serous Carcinoma 
Serous Carcinoma 
Serous Carcinoma 
Serous Carcinoma 
Serous Carcinoma 
Serous Carcinoma 
Mucinous Carcinoma 
Endometrioid Carcinoma 
Clear Cell Carcinoma 




£ - AOBQ 10 BIUOUIOJBO UBUBAQ 



939 SMS 10 BiuouioJBO ububaq 



SSet^-ai/y-VQIAl 10 BiuoupjBo jSBajg 



IXZ-aiAI-VQIAl "10 BiuoupjBo }SB8jg 



bluouiojbq ;sB9jg 



biuouiojbo e;B)soad 



BIUOUIOJBO U0|00 



U0|00 IBUJJON 



t 



u 

•H 



Q 
< 



< 
I- 
LU 



a;Aooijnan -g d 
uo|oo 

eu!jse;u| hblus 
ajbao 
sajsai 

snuiAiji 
uaa|ds 

Aaupw 

J3AH 

Bunn 
uiBjg 





CO 
CD 

a 

CO 
CO 
•H 

4-> 

Cfl 

s 
o 
c 

•H 

o 

CO 

o 
c 

•H 

CO 
> 

o 
c 



3 

cC 



CO 

+J 

cu 

c 
o 

•H 

CO 
CO 

cu 

X 



BWOUIOJBQ I13Q JB9IQ 

Btuou|OJBO pjoujawopug 

BiuoujOJBo snoujoniAl 
BiuoupjBQ snojas 

Ajbaq ibiujon 




CO 

o 

rH 

m 

OJ 
•H 



1 CT GT AGCAGGCAGAGCTT AC CAAGT CT CT CCGAACT CAAAT GGAAGAAATACCT TAT GAA 60 
61 T GT AAGAAT GTAGGGG GT CAT GGCTT GTAATT T ACACAGT GTAAAT GAAAC CAT C CT AGA 120 
121 G GATT AT GAGGAAT C CT TT CT AT GT GAT T T T CAAT C ATAGCAAGCAAGAAAGGCT CCAGT 180 
181 GTCAAGGTAGTTCAGCTCTTACAGGATATAAAACAGTCCATACTTGAGAGAAAAAACTTA 240 
241 GAT CT GAGT GAT GGAAT GTGAAGCAAAT CTTTCAAAAT CAGTAGACATTT CTTGGACAT A 300 
301 AAACAC AGAT GAGGAAAGGGCTT CAAATTAGAAGTT ACGTAAT CAC CAT CAGAAAGT T CA 360 
361 T GTTTGGTAAATT CTGTT ACTAGAAAT GTAGGAAATT CAGGTATAGCTTTGAATCCCAAT 420 
421 TACACATT GGT CAGT GGGAAAACTAAGGGCCT CCAACAGGCAAATTCAGGGAGGATAGGT 480 
481 TTCAGGGAATGCCCTGGATTCTGGAAGAC QTCACCATGQp ACGCCCCCGACCTCGTGCGG 540 

. MGRPRPRAA- 
541 r.r. A AG A CGTGGAT GTTCCTGCTCTTGCT GGGGGGAGCCTGGGCAGGACACTCCAGGGCAC 600 

K T W M |f L L L L 1 G G A W A G H S R A 0 - 
601 AGGAGGACAAGGTGCTGGGGGGT CAT GAGT GCCAACCCCATTCGCAGCC^TGG^GGCGG^ 660 

EDKVLGGHECQPHSQPWQAA- 
661 CCTTGTTCCAGGGCCAGCAACTACTCTGTGGCGGTGTCCTTGTAGGTGGCAACTGGGTCC 720 

L F O G Q QLLCGGVLVGGNWVL- 
721 TTACAGCTGCCCACTI STAAAAAACCGAAATACACAGTACGCCTGGGAGACCACAGCCTAC 780 

T A A H+C KKPKYTVRLGDHSL Q- 
781 AGAATAAAGAT GGC CC AGAGCAAGAAATAC CT GT GGTT CAGT CCAT CC CAC ACC C CT GCT 840 

NKDGPEQEIP V V 0 S I P H P C Y - 
841 A CAACAGC AG C GAT GT GGAGGAC CACAAC CAl fcAT CT GAT GCT T c| r T CAACT GCGT GACC 900 

IN S si DVEDHNHD+LMLLQLRDQ- 
901 AGGCATCCCTGGGGTCCAAAGTGAAGCCCATCAGCCTGGCAGATCATTGCACCCAGCCTG 960 

AS LGS KVKPI SLADHCTQPG- 
9 61 GCCAGAAGTGCACCGTCTCAGGCTGGGGCACTGT CAC CAGT CCCCGAGAGAATTTTCCTG 1020 

QKCTVS GWGTVTS PREN FP D - 
1021 AC ACT CT CAACT GT GCAGAAGTAAAAAT CTTT CC C CAGAAGAAGT GT GAGGAT GCT T ACC 1080 

TLNCAEVKI FPQKKCEDAYP- 
1081 C GGGGCAGAT CACAGAT GGCAT GGT CTGTGCAGGCAGCAGCAAAGGGGCTGACACGTGCC 1140 

G O I T D G M VCAGSSKGa(5)tCQ- 

1141 1agggcgattctggaggcccc| :tggtgtgtgatggtgcactccagggcatcacatcctggg 1200 

G D @G G P L VCDGALQGITSW (5)- 
1201 GCTCAGACCCCTGTGGGAGGTCCGACAAACCTGGCGTCTATACCAACATCTGCCGCTACC 1260 

sdpcgrsdkp(g)vytnicryl- 

1261 T GGACT GGAT CAAGAAGAT CAT AGGCAGCAAGGGCT GAT T CTAGGATAAGC ACT AGAT CT 1320 

DWIKKI IGSKG* <£Q ^ 7 
1321 CCCTT AATAA7\p TCACGGAATTC 5£m ^ njO; (p 

I i = Kozak's Consensus sequence 

+ = Conserved amino acids of catalytic triad H, D, S 
| nss 1 = Possible N - linked glycosylation site 

= Poly - adenylation signal 

t I = Conserved nt of catalytic triad 

0 = aa required for formation of an oxyanion hole for catalytic activity 

| flll | = Secretion signal sequence 



Figure 6. Complete sequence of TADG-14 transcript including 
ORF and common domains. 



Figure 7. Homology of TADG-14 with 
mouse neuropsin. 76% identity for ORF. 
Low homology outside of ORF. 
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Figure 8. Amino acid homology of TADG-14 with mouse neuropsin, 
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